Overview
Brought to you by YData
Dataset statistics
| Number of variables | 6 |
|---|---|
| Number of observations | 64829999 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.9 GiB |
| Average record size in memory | 96.9 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 4 |
id_categoria has constant value "3" | Constant |
liq_um is highly skewed (γ1 = 274.4975194) | Skewed |
liq_um has 1341925 (2.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-10-18 17:30:58.812138 |
|---|---|
| Analysis finished | 2025-10-18 17:38:15.697452 |
| Duration | 7 minutes and 16.89 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
id_categoria
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 GiB |
| 3 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 64829999 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 64829999 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 64829999 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 64829999 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 64829999 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 64829999 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 64829999 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 64829999 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 64829999 |
id_cliente
Real number (ℝ)
| Distinct | 131135 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 370578.5 |
| Minimum | 33 |
|---|---|
| Maximum | 727518 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 494.6 MiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 57426 |
| Q1 | 192577 |
| median | 366646 |
| Q3 | 550089 |
| 95-th percentile | 686441 |
| Maximum | 727518 |
| Range | 727485 |
| Interquartile range (IQR) | 357512 |
Descriptive statistics
| Standard deviation | 201342.97 |
|---|---|
| Coefficient of variation (CV) | 0.54332069 |
| Kurtosis | -1.1703441 |
| Mean | 370578.5 |
| Median Absolute Deviation (MAD) | 180198 |
| Skewness | -0.039920542 |
| Sum | 2.4024604 × 1013 |
| Variance | 4.053899 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 428303 | 4575 | < 0.1% |
| 385830 | 4530 | < 0.1% |
| 115924 | 4365 | < 0.1% |
| 449499 | 4305 | < 0.1% |
| 563742 | 4224 | < 0.1% |
| 99605 | 4213 | < 0.1% |
| 306941 | 4194 | < 0.1% |
| 382601 | 4145 | < 0.1% |
| 551634 | 4120 | < 0.1% |
| 452203 | 4068 | < 0.1% |
| Other values (131125) | 64787260 |
| Value | Count | Frequency (%) |
| 33 | 145 | < 0.1% |
| 43 | 544 | < 0.1% |
| 44 | 132 | < 0.1% |
| 47 | 94 | < 0.1% |
| 48 | 1717 | |
| 51 | 773 | |
| 56 | 122 | < 0.1% |
| 57 | 246 | < 0.1% |
| 60 | 91 | < 0.1% |
| 62 | 66 | < 0.1% |
| Value | Count | Frequency (%) |
| 727518 | 1096 | |
| 727517 | 79 | < 0.1% |
| 727515 | 228 | < 0.1% |
| 727514 | 3 | < 0.1% |
| 727512 | 1192 | |
| 727511 | 441 | < 0.1% |
| 727510 | 1194 | |
| 727509 | 71 | < 0.1% |
| 727507 | 623 | |
| 727505 | 81 | < 0.1% |
id_periodo
Real number (ℝ)
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 202392.55 |
| Minimum | 202301 |
|---|---|
| Maximum | 202508 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 494.6 MiB |
Quantile statistics
| Minimum | 202301 |
|---|---|
| 5-th percentile | 202302 |
| Q1 | 202309 |
| median | 202404 |
| Q3 | 202412 |
| 95-th percentile | 202507 |
| Maximum | 202508 |
| Range | 207 |
| Interquartile range (IQR) | 103 |
Descriptive statistics
| Standard deviation | 77.05203 |
|---|---|
| Coefficient of variation (CV) | 0.00038070586 |
| Kurtosis | -1.320406 |
| Mean | 202392.55 |
| Median Absolute Deviation (MAD) | 95 |
| Skewness | 0.21909395 |
| Sum | 1.3121109 × 1013 |
| Variance | 5937.0153 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=32)
| Value | Count | Frequency (%) |
| 202412 | 2359733 | 3.6% |
| 202303 | 2278704 | 3.5% |
| 202312 | 2274404 | 3.5% |
| 202501 | 2255210 | 3.5% |
| 202401 | 2251301 | 3.5% |
| 202403 | 2228632 | 3.4% |
| 202402 | 2211062 | 3.4% |
| 202503 | 2168995 | 3.3% |
| 202411 | 2144991 | 3.3% |
| 202301 | 2142199 | 3.3% |
| Other values (22) | 42514768 |
| Value | Count | Frequency (%) |
| 202301 | 2142199 | |
| 202302 | 2135091 | |
| 202303 | 2278704 | |
| 202304 | 2047442 | |
| 202305 | 1977191 | |
| 202306 | 1785987 | |
| 202307 | 1794284 | |
| 202308 | 1997513 | |
| 202309 | 1981820 | |
| 202310 | 2040537 |
| Value | Count | Frequency (%) |
| 202508 | 1884650 | |
| 202507 | 1839339 | |
| 202506 | 1713365 | |
| 202505 | 1892298 | |
| 202504 | 1955745 | |
| 202503 | 2168995 | |
| 202502 | 2118941 | |
| 202501 | 2255210 | |
| 202412 | 2359733 | |
| 202411 | 2144991 |
tipo_mix
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 GiB |
| GASEOSAS | |
|---|---|
| MINERALES | |
| NECTAR | |
| FUNCIONAL | |
| TEA | 743111 |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.930571 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NECTAR |
|---|---|
| 2nd row | NECTAR |
| 3rd row | NECTAR |
| 4th row | FUNCIONAL |
| 5th row | FUNCIONAL |
Common Values
| Value | Count | Frequency (%) |
| GASEOSAS | 30913090 | |
| MINERALES | 13661223 | |
| NECTAR | 11319776 | 17.5% |
| FUNCIONAL | 8192799 | 12.6% |
| TEA | 743111 | 1.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| gaseosas | 30913090 | |
| minerales | 13661223 | |
| nectar | 11319776 | 17.5% |
| funcional | 8192799 | 12.6% |
| tea | 743111 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 106400493 | |
| A | 95743089 | |
| E | 70298423 | |
| N | 41366597 | 8.0% |
| O | 39105889 | 7.6% |
| G | 30913090 | 6.0% |
| R | 24980999 | 4.9% |
| I | 21854022 | 4.3% |
| L | 21854022 | 4.3% |
| C | 19512575 | 3.8% |
| Other values (4) | 42109708 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 514138907 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 106400493 | |
| A | 95743089 | |
| E | 70298423 | |
| N | 41366597 | 8.0% |
| O | 39105889 | 7.6% |
| G | 30913090 | 6.0% |
| R | 24980999 | 4.9% |
| I | 21854022 | 4.3% |
| L | 21854022 | 4.3% |
| C | 19512575 | 3.8% |
| Other values (4) | 42109708 | 8.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 514138907 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 106400493 | |
| A | 95743089 | |
| E | 70298423 | |
| N | 41366597 | 8.0% |
| O | 39105889 | 7.6% |
| G | 30913090 | 6.0% |
| R | 24980999 | 4.9% |
| I | 21854022 | 4.3% |
| L | 21854022 | 4.3% |
| C | 19512575 | 3.8% |
| Other values (4) | 42109708 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 514138907 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 106400493 | |
| A | 95743089 | |
| E | 70298423 | |
| N | 41366597 | 8.0% |
| O | 39105889 | 7.6% |
| G | 30913090 | 6.0% |
| R | 24980999 | 4.9% |
| I | 21854022 | 4.3% |
| L | 21854022 | 4.3% |
| C | 19512575 | 3.8% |
| Other values (4) | 42109708 | 8.2% |
id_sku_venta
Real number (ℝ)
| Distinct | 459 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 443650.31 |
| Minimum | 810 |
|---|---|
| Maximum | 875220 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 494.6 MiB |
Quantile statistics
| Minimum | 810 |
|---|---|
| 5-th percentile | 957 |
| Q1 | 1657 |
| median | 870002 |
| Q3 | 870706 |
| 95-th percentile | 871275 |
| Maximum | 875220 |
| Range | 874410 |
| Interquartile range (IQR) | 869049 |
Descriptive statistics
| Standard deviation | 431072.97 |
|---|---|
| Coefficient of variation (CV) | 0.97165033 |
| Kurtosis | -1.9853483 |
| Mean | 443650.31 |
| Median Absolute Deviation (MAD) | 1503 |
| Skewness | -0.028943162 |
| Sum | 2.8761849 × 1013 |
| Variance | 1.858239 × 1011 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1600 | 1377920 | 2.1% |
| 4714 | 1232267 | 1.9% |
| 870203 | 1225221 | 1.9% |
| 1603 | 1179902 | 1.8% |
| 1756 | 1075198 | 1.7% |
| 870690 | 1058605 | 1.6% |
| 1602 | 1050387 | 1.6% |
| 1267 | 982919 | 1.5% |
| 1745 | 920687 | 1.4% |
| 4715 | 901034 | 1.4% |
| Other values (449) | 53825859 |
| Value | Count | Frequency (%) |
| 810 | 38312 | 0.1% |
| 816 | 45574 | 0.1% |
| 817 | 253331 | |
| 826 | 39020 | 0.1% |
| 829 | 244568 | |
| 833 | 453 | < 0.1% |
| 856 | 44272 | 0.1% |
| 859 | 285778 | |
| 905 | 223235 | |
| 909 | 117237 |
| Value | Count | Frequency (%) |
| 875220 | 5186 | < 0.1% |
| 875219 | 34260 | |
| 875218 | 4858 | < 0.1% |
| 875148 | 7158 | < 0.1% |
| 875147 | 7303 | < 0.1% |
| 875138 | 4104 | < 0.1% |
| 871569 | 3259 | < 0.1% |
| 871539 | 4003 | < 0.1% |
| 871537 | 26448 | |
| 871534 | 13963 |
liq_um
Real number (ℝ)
Skewed Zeros
| Distinct | 10924 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.28024086 |
| Minimum | -110.592 |
|---|---|
| Maximum | 2204.928 |
| Zeros | 1341925 |
| Zeros (%) | 2.1% |
| Negative | 4546 |
| Negative (%) | < 0.1% |
| Memory size | 494.6 MiB |
Quantile statistics
| Minimum | -110.592 |
|---|---|
| 5-th percentile | 0.024 |
| Q1 | 0.072 |
| median | 0.1152 |
| Q3 | 0.216 |
| 95-th percentile | 0.72 |
| Maximum | 2204.928 |
| Range | 2315.52 |
| Interquartile range (IQR) | 0.144 |
Descriptive statistics
| Standard deviation | 2.5545628 |
|---|---|
| Coefficient of variation (CV) | 9.115597 |
| Kurtosis | 147340.6 |
| Mean | 0.28024086 |
| Median Absolute Deviation (MAD) | 0.0576 |
| Skewness | 274.49752 |
| Sum | 18168015 |
| Variance | 6.5257909 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.144 | 6844704 | 10.6% |
| 0.072 | 5988861 | 9.2% |
| 0.096 | 5347015 | 8.2% |
| 0.048 | 4308539 | 6.6% |
| 0.0576 | 4259958 | 6.6% |
| 0.0768 | 3342929 | 5.2% |
| 0.288 | 3284268 | 5.1% |
| 0.12 | 2550515 | 3.9% |
| 0.192 | 2241163 | 3.5% |
| 0.024 | 1988927 | 3.1% |
| Other values (10914) | 24673120 |
| Value | Count | Frequency (%) |
| -110.592 | 1 | |
| -82.7904 | 1 | |
| -58.2912 | 1 | |
| -45.6192 | 1 | |
| -44.928 | 1 | |
| -41.6448 | 1 | |
| -31.584 | 1 | |
| -29.8944 | 1 | |
| -28.8 | 1 | |
| -27.648 | 1 |
| Value | Count | Frequency (%) |
| 2204.928 | 1 | |
| 2128.896 | 1 | |
| 2068.0704 | 1 | |
| 2054.592 | 1 | |
| 2021.76 | 2 | |
| 2005.632 | 1 | |
| 1868.1408 | 1 | |
| 1850.352 | 1 | |
| 1825.344 | 2 | |
| 1824.768 | 1 |
Interactions
Correlations
| id_cliente | id_periodo | id_sku_venta | liq_um | tipo_mix | |
|---|---|---|---|---|---|
| id_cliente | 1.000 | 0.026 | -0.001 | -0.009 | 0.011 |
| id_periodo | 0.026 | 1.000 | 0.086 | -0.016 | 0.020 |
| id_sku_venta | -0.001 | 0.086 | 1.000 | -0.185 | 0.426 |
| liq_um | -0.009 | -0.016 | -0.185 | 1.000 | 0.003 |
| tipo_mix | 0.011 | 0.020 | 0.426 | 0.003 | 1.000 |
Missing values
A simple visualization of nullity by column.